Main
Rich Pauloo, PhD
I spend most of my days writing code (mostly R, Python, SQL) to clean, visualize, and model data. I have a PhD in computational hydrogeology, where I simulated and visualized 3D contaminant transport in aquifers.
I’m an exert-level #rstats user. A few projects I’m proud of include R packages to query water quality data 📦 and text yourself from R 📱, R data science curriculum 📚, a dashboard that makes millions of water quality observations understandable 📈, and a model that predicts the risk of wells going dry 💧 funded by Microsoft’s AI for Earth Grant.
Education
PhD, Computational Hydrogeology
University of California Davis
Davis, CA
2020 - 2015
- Published 6 scientific papers (3 first-author).
- Tools used: R, Python, SQL, git/Github, bash, AWS, cron, dplyr, ggplot2, shiny, flexdashboard, leaflet, sf, MODFLOW, RW3D, Paraview, Illustrator, ArcGIS, Envi, LaTeX
- Won ~$153,000 in national, compeitive grants and awards from NASA, Microsoft AI for Earth, AGU, and others.
B.S., Integrative Biology (minor in Conflict Resolution)
University of California Berkeley
Berkeley, CA
2011 - 2006
Professional & Research Experience
Data Scientist
Larry Walker Associates
Berkeley, CA
present - 2020
- Built and automated ETL pipelines for ~180 real-time sensor networks and dashboards that process > 100,000 daily observations.
- Turned messy data into actionable information and automated reports.
- Tools used: R, Python, SQL, git/Github, bash, AWS, cron, dplyr, ggplot2, shiny, flexdashboard, leaflet, sf
- Managed multiple six-figure contracts, scoped work, contributed to strategic marketing, and trained staff.
Co-Founder
Water Data Lab
Remote
present - 2020
- Manage $105k in annual contracts for specialized data science consulting.
- Co-developed r4wrds.com
Data Engineer
UC Water
Davis, CA
2020 - 2018
- Developed a monitoring dashboard with interactive data visualization using AWS with R, SQL, Shiny, and Shiny Server. I also built an automated ETL pipeline that pulled data from an IoT sensor network to feed the dashboard. REsults were peer-reviewed and published.
Data Lab Researcher
Computational Institute for Geodynamics (CIG)
UC Davis
2019 - 2018
- NLP, text mining, and network analysis in R on a corpus of ~600 PDFs.
- Developed a R Shiny dashboard to understand the corpus.
- Results were peer-reviewed and published.